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1. INTRODUCTION 


Label Identification from Statistical Tabulation (LIST) Is an analyst's 
picture element (pixel) labeling procedure for making at>harvest small-grain 
proportion estimates In the Large Area Crop Inventory Experiment (LACIE). In 
this labeling procedure, the analyst-interpreter (AI) Is required to answer 
questions about the segment and pixels which relate to simple properties that 
discriminate small grains from nonsmall grains. The responses, along with 
pertinent agricultural and meteorological variables, are statistically weighted 
to develop a discriminant function which Is trained on blind site ground-truth 
labels. 

The preliminary development of LIST was analyzed and reported by Pore (ref. 1); 
from this, a semlautomated operational LIST was developed by Abotteen and 
Pore (ref. 2) and tested on blind sites in both Kansas and North Dakota 
(ref. 3). Improvements In the green nimber and brightness keys used In LIST 
were made by Dennis and Pore (ref. 4), and the resulting discriminant was 
tested on LACIE Phase III Kansas data. While testing this discriminant 
on LACIE Phase III data, an effort was made to use this data to produce alter- 
nate keys which would outperform greenness and brightness. Keys were developed 
for channel ratios and normalized channel rankings which a.*e now in study for 
use In spectral yield modeling. These new spectral keys are described in 
section 2. In section 3, the results obtained using these keys alone and 
using them in conjunction with the greenness and brightness keys are presented, 
and the Interpretations of the results are discussed. Conclusion:, and recom- 
mendations are summarized In section 4. 

The earlier LIST studies (ref. 1 through ref. 4) supported development of a 
semi -automated labeling technique that Is now being tested operationally. 

This study, however, Is a research and development study Into the possible 
benefit of Implementing an alternate set of spectral keys In the LIST dis- 
crimination process. 
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2. DEVELOPMENT 


Cate (ref. 5) an'^ Hay (ref. 6) proposed that the ratio of the land satellite 
(Landsat) Channel 4 to Channel 2 be used as a possible alternative to greenness 
as a spectral aid. This ratio and other combinations of spectral data were 
considered as possible replacements or Improvements to the greenness and bright- 
ness keys used In LIST. The specific ratios considered were: 

a. Channel 4 to Channel 2 

b. Channel 3 plus Channel 4 to Channel 1 plus Channel 2 

c. One-half of the sum of Channel 1 and Channel 2 

d. Channel 1 to Channel 2 

e. Channel 4 to Channel 3 

In addition to these ratios, another proposal by Cate (ref. 5) was considered. 
This proposal is a key based on the relative ranking of the four Landsat Chan- 
nels. The new keys were developed from the LACIE Phase III Kansas blind site 
data, and all of the blind sites which had four usable acquisitions were used. 
Listed In table 2-1 are the segments and acquisitions used. The Individual 
channels were first normalized using the method outlined by Cate (ref. 5), 
and then ratios were obtained. Jhese normalized ratios were standardized as 
a function of the Julian date and Robertson biostage. This standardization, 
not to be confused with the late normalization procedure above, is the same 
procedure that was applied to the green number and brightness transformations 
d_-scribed in ref. 4; however, in this case, the LACIE Phase III data were used 
to produce means and standard deviations of the ratios for each satellite sweep. 

In table ?-2, the means and standard deviations for both small grains and 
nonsmall grains obtained for each ratio are listed. Several of these ratios 
were immediately abandoned for two reasons. First, for each variable, LIST 
uses fcur standardized distances, their absolute values, and two trajectory 
comparisons which give a total of ten new diminsions; for five ratios, the 
number of dimensions would be too large. Second, several of the ratios showed 
little or no separation between the means and standard deviations of grains 
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TABLE 2-1 CATA USED IN THE DEVELOPMENT 


Segment 

Acquisitions 

1851 

6289, 

6361, 

7067, 

7193 

1340 

6287, 

6305, 

7101, 

7191 

1175 

6287, 

6305, 

7101, 

7191 

1881 

6288, 

6324, 

7066, 

71 56 

1885 

6287, 

7101, 

71 55, 

7191 

1279 

6290, 

7050, 

7158, 

7194 

1285 

6272, 

7086, 

7158, 

7194 

1033 

6288, 

6325, 

7084, 

7156 

1890 

6288, 

6360, 

7066, 

7156 

1183 

7063, 

7082, 

7099, 

7154 

1021 

6255, 

6363, 

7159, 

7194 

1295 

6288, 

6306, 

7066, 

7156 

1864 

6326, 

7050, 

7122, 

7194 

1170 

6287, 

6305, 

7101, 

7191 

1346 

6287, 

6305, 

6323, 

7101 

1032 

7068, 

7086, 

7158, 

7194 

1853 

6253, 

6361, 

7067, 

7193 

1861 

6326, 

7104, 

7158, 

7194 

1180 

'•6285, 

7081, 

7099, 

7153 

1158 

6287, 

6323, 

7101, 

7155 

1166 

7046, 

7082, 

7154, 

7190 
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ind nongrains. Therefore, It was decided to Unit the ratios used In the 
testing to the following: 

a. Channel 4 to Channel 2 

b. Channel 3 plus Channel 4 to Channel 1 plus Channel 2 

In addition to testing the two ratios, the ranking of the four normalized 
channels was studied. For example. If the reflectance In Channel 4 war. 
greater than the reflectance In Channel 3, which was greater than that In 
Channel 1, which was greater than that In Channel 2, then there was a better 
chance that the sample represented healthy vegetation than If some other 
ranking of channels was observed. In order to develop the channel ratios Into 
a quantitative measure, a rather complicated trcnsformatlon of the rankings 
was generated using empirical data to Indicate the likelihood that a pixel 
with a given ranking Is a small-grains pixel on a particular pass. A 
description of this transformation of the rankings follows. 

Frequency tables were constructed to show which rankings were Indicative of 
grains at various times of the growing season and which rankings were more 
Indicative of nongrains. The rankings that were observed to favor grains 
and nongrains at various times and the percentages of grains and nongrains 
observed in each class are listed In table 2-3. For each acquisition, a 
pixel was assigned a value of or 0, depending on whether the ranking 

was favorable for grains, favorable for nongrains, or neutral. These four 
values obtained from table 2-3 and their sum together with the variables 
obtained from the above ratios were studied as potential discriminant 
variables. 

The discriminant variables were then analyzed using a stepwise discrimination 
process to evaluate the discrimlnabll Ity cf the ensemble of variables. The 
objective was to derive the subset of variables that yielded optimum 
discrimlnablllty. 
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TABLE 2-3.- OBSERVED CHANNEL RANKINGS 


I 


Acqyimteit f«««r1iig 


NrctnUf* of 
fratn tnd ethtr 
iMpIn In 
f*«oraklt 
cata9ory 


Mnkliift wt 
^•Mrlng •r«lm 


^trcOTUfc of 
train tntf etiwr 
unplM In 
Mifamrtblt 
catagory 


Syntol 

aanking of channolt 

A 

1 i 2 1 3 i 4 

a 

1 E 2 % 4 > 3 

c 

1 S 3 > 2 1 4 

0 

1 1 3 1 4 > 2 

E 

U 4 > 2 1 3 

P 

1 2 4 > 3 > 2 

a 

2 > 1 > 3 i 4 

H 

2 > 1 t 4 > 3 

I 

2 i 3 > 1 i 4 

J 

2 1 3 i 4 > 1 

K 

2 1 4 > 1 1 3 

L 

2 k 4 > 3 > 1 


6. H. J. K. L. f 

5. H. 1. J. L. 0. P, a. U. I 
a. C. 6. H. I. J. P 

а. 6. H. 1. J. U 

6. H. I 

б. H, I 

a. 6 


Syitel 

aanking of channolt 

N 

3 > 1 t 2 i 4 

a 

a » 1 1 4 > 2 

0 

3 > 2 > 1 f a 

p 

3 > 2 i 4 > 1 

q 

3 1 4 > 1 i 2 

a 

3 2 4 > 2 > 1 

s 

4 > 1 2 2 i 3 

T 

4 > 1 2 3 > 2 

U 

4 > 2 > 1 2 3 

V 

4 > 2 2 3 > 1 

u 

4 > 3 > 1 2 2 

X 

4 > 3 > 2 > 1 

























3. RESULTS AND INTERPRETATIONS 


3.1 RESUL TS 

Stepwise and direct linear discriminant analyses were performed on the LACIE 
Phase III Kansas blind site data. Both analyses were performed using the 
Statistical Package for Social Sciences (SPSS) described In reference 7. The 
stepwise analyses were used primarily to Indicate which variables were more 
useful In the discrimination process and which variables could possibly be 
removed for a later analysis. The direct analyses were u.*ed to compare train- 
ing accuracies for various sets of discriminant variables. Table 3-1 Illus- 
trates the format used for the tables 3-2 through 3-6 » which show the results 
of these analyses. This format Is the same as that used for tables In ref- 
erences 3 and 4. 

The best accuracy obtained, using only subsets of the new keys and the five 
AI canopy keys. Is presented In table 3-2. The overall accuracy here was 
significantly lower than accuracies observed In previous studies (ref. 4), 
and attempts to Improve accuracy were made by Including the greenness and 
brightness keys. The accuracy obtained using ratio variables, channel rank- 
ings, AI canopy keys, and greenness and brightness keys Is shown In table 3-3. 
The total number of variables Involved, however, makes the use o^ all four 
sets of keys Impractical. 

The stepwise portion of this analysis Indicated that using both ratios was 
providing redundant Information and that It might be possible to drop the 
4 to 2 ratio. Tabic 3-4 Illustrates that Identical results were obtained when 
this ratio was dropped from the analysis. However, the total number of vari- 
ables was still too great for practical use. 

In table 3-5, the accuracy obtained when the (3 + 4)/(l + 2) ratio was dropped 
is shown. Only the channel rankings, AI canopy keys, and greenness and 
brightness keys remained in the analysis. In table 3-6, the results obtained 
using only greenness and brightness and AI canopy keys are given. These are 
the variables that were tested on the LACIE Phase II Kansas data and are 
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TABLE 3-1 CONTINGENCY TABLE KEY 


Ground truth 

Direct discriminant 

Conditional 

Small grains 

Other 

pe-cent 

Small grains 

a 

b + e 

9 

Other 

c 

d ♦ f 

h 

Conditional percent 

1 

J 



PCL - Total « 

GSG - 
DSG > 

Symbol definitions: 

a. b. c, and d * raw pixel counts for the four test segments. 

e and f » raw pixel counts for the designated "other pixels." 

Total » k»a + b + c + d + e + f. 

g, h, 1, and J <= marginal probabilities (Pp)t expressed as percentages, of 
correct labeling: 

g ■ ^ — X 100 * (1 - (omission)) x 100 

h ■ ^ ^ ^ ^ x 100 = (1 - P^ (commission)) x 100 

J ■ b i-' Mv. f « ’<"> 

GSG ■ ^ ^ ^ ^ ^ X 100 « the ground-tnjth percentage of 

small grains 

DSG » ^ ^ ^ X 100 » the discriminant-labeled percentage of 

small grains 

PCL * ^ ^ * ■ X 100 = the probability (expressed as a 

percentage) of correct labeling 
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TABLE 3-2.- RESULTS OBTAINED USING AI CANOPY KEYS TOGETHER WITH KEYS 
DEVELOPED FROM NEW RATIOS AND CHANNEL RANKINGS 


Ground truth 

Direct discriminant 

Conditional 

Small grains 

Other 

percent 

Small grains 

714 

150 ♦ 80 

75.2 

Other 

169 

1054 + 1232 

93.1 

Conditional percent 

80.9 

90.6 



PCl ■ Bfl.l percent Total ■ 3405 

GSG ' 27.9 percent 
DSG ■ 25.9 percent 


TABLE 3-2.- RESULTS OBTAINED USING AI CANOPY KEYS, GREENNESS AND 
BRIGHTNESS KEYS, AND NEW RATIO AND CHANNEL RANKING KEYS 


Ground truth 

Direct discriminant 

Conditional 

Small grains 

Other 

percent 

Small grains 

737 

133 i- 80 

77.6 

Other 

155 

1068 + 1232 

93.1 

Conditional percent 

82.6 

91.5 



PCL = 89,2 percent Total = 3405 

GSG = 27.9 percent 

DSG ■ 26.2 percent 
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TABLE 3-4.- RESULTS OBTAINED USING AI CANOPY KEYS 
GREENNESS AND BRIGHTNESS KEYS, AND NEW RATIO AND 
CHANNa RANKING KEYS WITH THE 4:2 RATIO REMOVED 


Gro' d truth 

Direct discriminant 

Conditional 

percent 

Small grains 

Other 

Small grains 
Other 

732 

150 

138 + 80 
1073 + 1232 

77.1 

93.9 

Conditional percent 

83.0 

91.4 



PCL = 89.2 percent Total = 3405 

GSG = 27.9 percent 
DSG = 25.9 percent 


TABLE 3-5.- RESULTS OBTAINED USING AI CANOPY KEYS. 
GREENNESS AND BRIGHTNESS KEYS, AND CHANNEL RANKING KEYS 


Ground truth 

Direct discriminant 

Conditional 

Small grains 

-c.ier 

percent 

Small grains 

730 

140 + 80 

76.8 . 

Other 

155 

1068 + 1232 

93.7 

Conditional percent 

82.5 

90.6 



Total = 3405 
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PCL =89.0 percent 
GSG = 27.9 percent 
DSG = 26.0 percent 



























TABLE 3-6.- RESULTS OBTAINED USING ONLY AI CANOPY KEYS 
AND GREENNESS AND BRIGHTNESS KEYS 


Ground truth 

Direct discriminant 

Conditional 

Small grains 

Other 

percent 

Small grains 

680 

190 + 00 

71.6 

Other 

186 

1037 + 1232 

92.4 

Conditional percent 

78.5 

90.6 



PCL = 86.6 percent Total = 3405 

GSG =27.9 percent 
DSG = 25.4 percent 
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currently in use in LIST. It is interesting to note the difference in accu- 
racies on LACIE Phase II data (ref. 4) and LACIE Phase III data. In fact all 
accuracies for the LACIE Phase III data are significantly lower than those 
observed for LACIE Phase II data. An explanation is given in section 3.2. 

3.2 INTERPRETATIONS 

The difference in accuracies of the greenness and brightness keys for the 
LACIE Phase II and Phase III data can be partially explained by the fact that 
these keys were built using LACIE Phase II data and were, therefore, nx)re 
efficient at explaining Phase II data. This must also explain the difference 
between the results for greenness and brightness keys and the ratio keys. 
Because the ratio keys and spectral ranking keys were built using LACIE 
Phase III data, they should improve the accuracy of greenness and brightness 
keys on that data. In view of this, the difference between the greenness and 
brightness accuracy and the new variables accuracy is not considered 
significant. 
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4. CONCLUSIONS AND RECOMMENDATIONS 


This study indicates that the particular channel ratios studied (section 2: 
a, b, c, d, and e) are not superior to greenness and brightness for discrim- 
inating small grains in Kansas. Furthermore, this study demonstrates that 
the normalization and ranking system described in section 2, table 2-3 does 
contribute to the discrimination of small grains from other. However, this 
was discovered in a developmental mode of analysis and has not been tested 
on an independent test set. 

It is recoumended that greenness and brightness be continued in LIST discrim- 
ination (in lieu of channel ratios), and further, that the ranking system be 
tested for possible use in small-grains discrimination. 
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